首页> 外文OA文献 >Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

【2h】

Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

机译：语言递归神经网络的可扩展贝叶斯学习造型

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recurrent neural networks (RNNs) have shown promising performance forlanguage modeling. However, traditional training of RNNs using back-propagationthrough time often suffers from overfitting. One reason for this is thatstochastic optimization (used for large training sets) does not provide goodestimates of model uncertainty. This paper leverages recent advances instochastic gradient Markov Chain Monte Carlo (also appropriate for largetraining sets) to learn weight uncertainty in RNNs. It yields a principledBayesian learning algorithm, adding gradient noise during training (enhancingexploration of the model-parameter space) and model averaging when testing.Extensive experiments on various RNN models and across a broad range ofapplications demonstrate the superiority of the proposed approach overstochastic optimization.

机译：递归神经网络（RNN）在语言建模方面已显示出令人鼓舞的性能。然而，使用反向传播的传统RNN训练通常会遭受过度拟合的困扰。原因之一是随机优化（用于大型训练集）无法提供模型不确定性的良好估计。本文利用最近的随机梯度马尔可夫链蒙特卡罗方法（也适用于大型训练集）来学习RNN中的权重不确定性。它产生了一种有原则的贝叶斯学习算法，在训练过程中增加了梯度噪声（增强了对模型参数空间的探索），并在测试时进行了模型平均。在各种RNN模型和广泛应用中的大量实验证明了该方法过随机优化的优越性。

著录项

作者
Gan, Zhe; Li, Chunyuan; Chen, Changyou; Pu, Yunchen; Su, Qinliang; Carin, Lawrence;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Bayesian Recurrent Neural Network for Language Modeling [J] . Chien Jen-Tzung, Ku Yuan-Chu Neural Networks and Learning Systems, IEEE Transactions on . 2016,第2期

机译：贝叶斯递归神经网络的语言建模
2. Recurrent neural network language model adaptation with curriculum learning [J] . Yangyang Shi, Martha Larson, Catholijn M. Jonker Computer speech and language . 2015,第1期

机译：递归神经网络语言模型自适应与课程学习
3. Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks [J] . Bitzer S., Kiebel S.J. Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2012,第4a5期

机译：识别递归神经网络（rRNN）：递归神经网络的贝叶斯推断
4. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [C] . Zhe Gan, Chunyuan Li, Changyou Chen, Annual meeting of the Association for Computational Linguistics;Conference of the European Chapter of the Association for Computational Linguistics . 2017

机译：递归神经网络的可扩展贝叶斯学习用于语言建模
5. Gene expression temporal patterns classification with hierarchical Bayesian neural networks and time lagged recurrent neural networks. [D] . Liang, Yulan. 2003

机译：利用分层贝叶斯神经网络和时滞递归神经网络对基因表达时间模式进行分类。
6. Large-scale directional connections among multi resting-state neural networks in human brain: A functional MRI and Bayesian network modeling study [O] . Rui Li, Kewei Chen, Adam S. Fleisher, -1

机译：人脑中多静态神经网络中的大规模定向连接：功能性MRI和贝叶斯网络建模研究
7. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling [O] . Gan, Zhe, Li, Chunyuan, Chen, Changyou, 2017

机译：语言递归神经网络的可扩展贝叶斯学习造型

Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅